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PATENT APPLICATION 



IMAGE ANALYSIS OF THE GOLGI COMPLEX 



BACKGROTTND OF THE INVENTION 

The present invention relates to methods and apparatus for characterizing a cell's 
10 condition based upon the state of its Golgi organelle. More specifically, the invention 
relates to image analysis methods and apparatus that rapidly characterize a ceU based upon 
phenotypic charactCTistics of its Golgi conq)lex. 

The Golgi complex is a central and complex organelle witiun eukaiyotic cells. It is 
involved in the sorting, modifying, and transportmg of cellular molecules. It plays 
1 5 important roles in intracellular processes including endocytosis, exocytosis, and transport 
between ceUular organelles. A detailed discussion of the role of the Golgi complex in 
cellular processes can be found in various treatises on cell biology. One exan^le is 
Alberts et al. '•Molecular Biology of the CeU" Garland PubUshing, Inc., which is 
incorporated herein by reference for all purposes. 

20 The Golgi complex can be an important indicator of the cellular effects caused by 

certain external agents. As a phenotypic characteristic, the Golgi complex has 
considerable value in drug discovery and fundamental biological research. For example, 
certain drugs and genetic modifications subtiy affect transport pathways within a oeU in 
ways that are manifest by the Golgi condition. Unfortunately, the value of the Golgi 

25 complex as an indicator has not been fully realized. This is because no simple consistent 
technique has been developed for characterizing the condition of Golgi complex in a high 
throu^put maimer. 



gT TMM ARY OF T fTP. INVENTION 

This invention addresses the above need by providing methods, code and apparatus 
that analyze cell images to automatically identify and characterize Golgi in individual 
cells. The invention accompHshes tiiis by furst locating the cells in the image and defining 
subregions of those cells that subsume some or all of tiie Golgi of liiose cells. The Golgi 

1 
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complex in the images typically have intensity values corresponding to the concentration 
of a Golgi component in the cell (e.g. a polysaccharide associated with the Golgi 
complex). The method/system then analyzes the Golgi components of the image to 
mathematically characterize the Golgi complex of individual cells. This mathematical 
5 characterization represents phenotypic information about the cells* Golgi complex and can 
be used to infer biological/physiological state of cells. From this information, mechanism 
of action and other important biological information can be deduced. 

One aspect of this invention pertains to a method of analyzing an image of one or 
more cells. This method may be characterized by the following sequence of operations 

10 (typically implemented on a computing device): (a) identifying a region of the image 
subsuming some or all of the Golgi complex of a smgle cell; (b) within this region, 
automatically identifying the location of the Golgi complex; and (c) automatically 
mathematically characterizing the Golgi conq)lex within the single cell. Preferably, the 
mathematical characterization is based upon (i) the Golgi complex location within the 

1 5 region and/or (ii) the concentration of a Golgi component within the region. 

In addition to a basic characterization of the Golgi complex, which typically 
involves some basic morphological or statistical characterization, the process may 
automatically classify the Golgi complex in a category that at least partially distinguishes 
between normal Golgi and Golgi that is either difiuse or disperse or both diflRise and 
20 disperse. Such classification may be accomplished using a biological model, in the form 
of a neural network or regression model (e.g. a CART), for example. 

Regarding the operation of identifying the region of the image subsuming some or 
all of the Golgi conq)lex, the process may "segment" the image into multiple regions, each 
subsuming at least part of an mdividual cell. In a one example, segmentation involves 
25 identifying locations of nuclei in the cells, and then dilating the locations of the nuclei to 
subsume the locations of some or all of the Golgi complex. 

Many different mathematical characterizations of the Golgi complex may be made. 
Preferably, all of them have biological relevance. Examples of types of mathematical 
characterizations include (i) an indicator of the peakedness of a histogram of at least one 

30 component of the Golgi complex, (ii) the texture of the Golgi complex and (iii) and the 
amount of Golgi complex in the region. As specific examples, the mathematical 
characterization of the Golgi complex include the kurtosis of intensity values obtained 
from the image, eigenvalues of a singular value decomposition of intensity values obtained 
from the image, and at least one of a mean and a standard deviation of intensity values 

3 5 obtained from the image. 
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Another aspect of tbis invention pertains to apparatus for automatically analyzing 
an image of one or more cells. Such apparatus may be characterized by the following 
features: (a) an interface configured to receive the image of one or more cells; (b) a 
memory for storing, at least temporarily, some or all of the image; and (c) one or more 

5 processors in communication with the memory and designed or configured to segment ttie 
image into discrete regions, each subsuming some or all of fhe Golgj covaplsK in single 
celL The processors may additionally characterize the Golgi conq)lex of single cells by 
operating on the discrete regions. Still further, the processors may be designed or 
configured to classify the Golgi complex based upon such characterization of the Golgi 

10 conq>lex. In one example, the classification distinguishes between normal Golgi and 
Golgi that is diffuse or disperse. The classification may make use of a biological model, in 
the form of a classification and regression tree, for example. 

Still another aspect of the invraition pertains to methods of producing a model for 
classifying cells based upon the condition of Golgi within the cells. Such me&od may be 

15 characterized by the following sequence: (a) receiving images of a pluraUty of cells of a 
training set; (b) analyzing the images to mathematically characterize the Golgi within the 
multiple cells from the training set; and (c) applying a modeling technique to the 
mathematical characterizations obtained m (b) to thereby produce the model. Typically, 
the framing set will contain individual cells having Golgi in various states. The various 

20 states of Golgi in the cells of the training set may be produced, at least in part, by treatment 
wifli multiple exogenous agents such as drugs or drug candidates. 

The process of analyzing the images ma&ematically may involve some of the 
operations outlined above such as segmentation, characterization, and classification. In 
one preferred approach, the modeling technique comprises generating a classification and 
25 regression tree. 

Yet another aspect of 1he invention pertains to computer program products 
including machine-readable media on which are stored program instructions for 
implementing a portion of or an entire method as described above. Any of tiie methods of 
this invention may be represented, in whole or in part, as program instructions tiiat can be 
30 provided on such computer readable media. In addition, the invention pertains to various 
combinations of data generated and/or used as described hereiiL 

These and other features and advantages of the present mvention will be described 
in more detail below with reference to the associated figures. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

Figures 1 A-ID depict, in cartoon fashion, cells having Golgi in normal, dispersion, 
difiRision, and dififixsion/dispersion states, respectively. 

Figure 2 is a block diagram of an architecture and data flow for using multiple 
5 image analysis algorithms, each handling a different image of a microscopic field. 

Figure 3 is a process flow diagram depicting one image analysis process for 
characterizing Golgi in accordance with this invention. 

Figure 4A generally illustrates how an image of multiple cells may be segmented to 
provide separate representations of individual cells and thereby allow a cell-by-cell 
10 analysis. 

Figure 4B generally depicts a representation of a cell as a binary object as 
identified using an edge finding routine. 

Figures 5 A and 5B depict a technique for "dilating" an image of a nucleus to create 
a perinuclear ring region where normal Golgi tend to localize. 

1 5 Figure 6 is an intensity histogram showing peaked and flat curves corresponding to 

normal and difRise Golgi, respectively. 

Figure 7 is a cartoon depiction of cellular Golgi showing how normal and dispersed 
Golgi segregate inside and outside the perinuclear region, respectively. 

Figure 8 is a sample classification and regression tree of the type fiiat may be 
20 employed to classify Golgi in accordance with this invention. 

Figure 9 is a block diagram of a computer system that may be used to inaplement 
various aspects of this invention such as the various image analysis algorithms of this 
invention, 

25 DETAILED DESCRPTION OF INVENTION 

A premise of this invention is that the shape and arrangement of Golgi in a cell 
provides valuable mfonnation about the cell's condition. If such condition results fi-om 
treatment with a drug or other exogenous agent, then the Golgi complex may shed li^t on 
a mechanism of action. 

4 
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In most normal cells, the Golgi complex is concentrated in a perinuclear region. In 
other words, normal Golgi is located in a cell's cytoplasm proximate to the nucleus. The 
chemical composition of the Golgi complex (and associated organelles and vacuoles) also 
afifects the state of the Golgi within a cell's cytoplasm. Thus, when the particular drug or 
5 other exogenous agent affects a cell's cytoskeletal components and/or Golgi chemistry, 
this mechanism of action can be identified or suggested by examining the cell's Golgi 
complex, 

Li accordance with this invention, image analysis of the Golgi complex may be 
used to characterize the efifects of any number of cellular stimuli. Stimuli of interest may 

10 include exposure to materials, radiation (including all manner of electromagnetic and 
particle radiation), forces (including mechanical, electrical, magnetic, and nuclear), fields, 
and the like. Examples of materials that may be used as stimuli include organic and 
iaorganic chemical compounds, biological materials such as nucleic acids, carbohydrates, 
proteins and peptides, lipids, various infectious agents, mixtures of the foregoing, and the 

15 like. Other specific examples of agents include exposure to different temperatures, non- 
ambient pressure, acoustic energy, electromagnetic radiation of all frequencies, the lack of 
a pardcxxlar material (e.g., the lack of oxygen as in ischemia), etc. 

Turning now to Figures lA, IB, IC, and ID, the Golgi complex may exist in four 
separate and identifiable end states. Other detectable end states may also exist. For 

20 purposes of illustration, however, this document will focus on the four states depicted in 
Figures lA - ID. Figure lA depicts a cell in an interphase stage of the cell cycle. Many 
cells have Golgi generally located as depicted, so long as they are not experiencing a major 
perturbation. As shown, a cell 101 includes a nucleus 103 and the Golgi ^paratus 105. In 
this **normal" cell, the Golgi complex lies close to the nucleus 103 primarily on one side of 

25 the nucleus. A region of the cell where the Golgi complex resides is also proximate to tiie 
microtubule organizing center (MTOC), a stmcture within the cells &om which the 
microtubules 107 radiate. Normal localization and morphology of the Golgi complex 105 
is mediated by, among other factors, microtubules 107 and microtubule motors, proteins 
that drive the motion of various cellular components along microtubules. 

30 Some changes in physiological conditions of the cell might cause firagmentation of 

the Golgi apparatus. Golgi in this condition is said to be "disperse." Figure IB depicts, in 
cartoon fashion, a cell 109 in which Golgi fragments 111 are dispersed througjiout the 
cytoplasm. This state of the Golgi complex may arise, for example, when the microtubule 
component of the cytoskeleton has been disrupted. Some drags, for example, cause 

35 depolymerization of tubulin. When this occurs, the Golgi complex no longer remains 
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confined to tight bands in the MTOC. Rather, it disperses tiiroughout the cytoplasm as 
depicted in Figure IB. Other exogenous agents may disrupt microtubule motors. This too 
can cause dispersion of the Golgi complex. In general, exogenous agents that disrupt the 
microtubule cytoskeleton, either directly or indirectly, lead to dispersion of the Golgi 
5 complex. 

Figure IC depicts a third «id state of the Golgi organelle. As shown there in 
cartoon fashion, a cell 1 13 has had its Golgi 115 spread throughout the cytoplasm without 
fragmentatioiL Here again the Golgi complex no longer remains confined to the 
perinuclear area. However, rather flian forming discrete fragments it diflRises more or less 
10 evenly throughout the cells cytoplasm. Golgi in this condition is said to be in a "difiuse" 
state. 

Golgi diffusion typically results when the dynamic barrier between the Golgi 
complex and endoplasmic reticulum compartments is disrupted to some degree. Actually, 
at this point, there may be no distinct Golgi. Rather the endoplasmic reticulum and Golgi 
15 have become one entity. This condition may result Scorn a direct or indirect effect on the 
pathways between the endoplasmic reticulum and the Golgi complex. Any of a number of 
different exogenous and endogenous influences may cause ttiis. In the end, the chemistry 
of the Golgi membrane changes significantly from its normal state. 

The various states depicted in Figures lA-lC represent "end states" in which the 
20 Golgi complex has reached an extreme. Obviously, depending upon the level of an effect 
causing cell disruption, the Golgi complex may exist in any one of these states only to a 
Innited degree. Thus, for example, a cell may exist may exist in a state between difiusion 
and dispersion as depicted in Figure ID. As shown there, a cell 117 mcludes both difiuse 
Golgi components 115 and disperse Golgi components 111. In preferred embodiments, 
25 this invention can determme the degree to which a particular cell resides in any one of the 
4 states depicted in Figures 1 A- ID. 

Note also that the some of the influences may affect absolute amounts of Golgi 
components interacting with a particular Golgi marker. Some embodiments of this 
invention characterize the cell based upon the concentration of a marked Golgi component, 
30 which is affected by the degree to which an amount of a chemical constituent (or 
constituents) of the Golgi complex has changed from its normal state. 

Note that the normal state depicted in Figure 1 A is normal only for interphase cells, 
not for mitotic cells. Golgi naturally becomes dispersed and/or diffuse during mitosis. 
When a large number of dividmg cells are present, these effects can obscure detection of 
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interesting changes to the Golgi complex that one might wish to detect Therefore, the 
present invention may include operations that distinguish between mitotic and interphase 
cells, hi the subsequent discussion, much of the characterization of the Golgi complex 
assumes that an algorithm has aheady Umited consideration to interphase cells, 

5 

Preparation of the hnage 

Generally the images used as the starting point for the methods of fliis invention are 
obtained from cells that have been specially treated and/or imaged under conditions that 
contrast the cell's Golgi fiom other cellular components and the background of the unage. 

10 hi the preferred embodiment, the cells are fixed and then treated with a labeling agent (i.e., 
a marker) that binds to one or more Golgi components and shows up in an image. 
Preferably, the chosen agent specifically bmds to cell components that are enriched in the 
Golgi complex. The agent should provide a strong signal to show where it is concratrated 
in a given cell. To this end, the ^ent may be luminescent, radioactive, fluorescent, etc. 

1 5 Various stains and fluorescent compounds may serve this purpose. 

The Golgi complex has numerous components that can be marked for generating 
images for use with this mventioiL For example, the Golgi contains specific protems, 
Upids, and polysaccharides that can be marked for analysis. Some of ttiese components 
occur in much higher concentration withm the Golgi complex than wifhm other cellular 

20 components, hi one preferred embodunent, the Golgi complex is identified using 
fluorescently labeled antibodies that bmd specifically to a Golgi component. In a 
particularly preferred embodunent, the Golgi complex is identified by treatment with 
labeled Lens cuhnaiis lectin (LC lectin). Lectins generally bind very specifically to certam 
polysaccharides. LC lectin bmds to polysaccharide chams on proteins (jprotoglycans) 

25 having a N-acetyl-d-glucoseamine residue at the end of the polysaccharide chain. 
Alternatively one could use antibodies to proteins enriched in the Golgi complex, such as 
gpl30, [betajCOP. 

Various techniques for preparing and unagmg appropriately treated cells are 
described in U.S. Patent AppUcations 09/310,879, 09/311,996, and 09/311,890, previously 
30 incorporated by reference, hi the case of cells marked with a fluorescent material, a 
collection of such cells is illuminated with Ught at an excitation firequency. A detector is 
tuned to collect light at an emission firequency. The collected hght is used to generate the 
image and highlights regions of high Golgi component concentration. 
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Sometime corrections must be made to the measured intensity. This is because the 
absolute magnitude of intensity can vary from image to image due to non-linearities in the 
image acquisition procedure and/or apparatus. Specific optical aberrations can be 
introduced by various image collection components such as lenses, filters, beam splitters, 
5 polarizers, etc. Other non-linearities may be introduced by an excitation li^t source, a 
broad bmd light source for optical microscopy, a detector's detection characteristics, etc. 
For example, some optical elements do not provide a "flat field" As a result, pixels near 
the center of the image have their intensities exaggerated in conqjarison to pixels at the 
edges of the image. A correction algorithm may be applied to compensate for this effect. 
10 Such algorithms can be easily developed for particular optical systems and parameter sets 
employed using those imaging systems. One simply needs to know the response of the 
systems under a given set of acquisition parameters. 

General Algorithms for Image Analysis 

Various algorithms can be used to analyze cell images that highlight Golgi. 

15 Certain specific examples will be described herein. In general, much of the relevant 
information associated with Golgi can be obtained by specifically analyzing interphase 
cells and ignoring, or treating separately, mitotic cells. Preferably, algorithms of this 
mvention identify the normal, diffuse, and disperse states of Golgi vdthin interphase cells. 
To this end, the algorithm may obtain certain morphological and/or statistical information 

20 about the Golgi complex from the image and use that information to classify the Golgi. 

Most images of relevance will depict the Golgi complex as variations in mtensity 
over position in the image, with higher intensity regions in the image corresponding to 
regions of the cell where the Golgi component exists in relatively high concentrations. 
Examples of morphological and statistical hnage characteristics that can be used to 
25 effectively characterize the Golgi include the "peakedness" of a histogram of pixel 
intensities, the texture of the Golgi regions of the image, the overall or total intensity 
obtained in the image, and the moments of the Golgi complex about a nucleus identified in 
a cell image. 

Often it will be desirable to use an image analysis algorithm for Golgi in 
30 conjunction with image analysis algorithms for other components of a cell. In fact, in a 
preferred algorithm described herein, it is necessary to first identify the nucleus of a cell 
prior to identifying the Golgi complex. 

Figure 2 depicts, from a global perspective, an algorithm (or group of algorithms) 
that characterizes a cell in terms of its nuclei, Golgi, and tubulin. As shown in Figure 2, a 

8 
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global image analysis 200 begins with one or more images 201 as input. In this example, 
image 201 possesses three separate "channels." Each of these channels represents a 
different label that can be separately captured by the imaging apparatus and represented 
distinctly in an image of the cell or a field of ceUs. In a typical example, the separate 
5 labels are separate fluorophores having distinct emission frequencies. In a very specific 
example, ceUs are treated with a marker for DNA emitting in blue, a marker for the Golgi 
complex emitting in green, and a marker for tubulin emitting in red. As depicted in Figure 
2, the image analysis algorithm 200 includes separate process branches: a nuclei analysis 
branch 203, a Golgi analysis branch 205, a tubulin analysis branch 207. 

10 Each of these branches has its own separate algorithm for analyzing the image. As 

shown, the nuclei are analyzed with an algorithm 209, the Golgi complex are analyzed 
with an algorithm 21 1, and the tubulin is analyzed with a algoritimi 213. Each of these 
algorithms act on information taken fix)m the channel specific to the cell component of 
interest. The output of each of these algorithms is flie combination of extracted features 

15 and, possibly (not necesserily), a separate classification or characterization of the 
associated ceU component according to its state in a particular cell. As depicted, flie 
output of algorithm 209 is the combination of identified individual nuclei (this is the part 
of the output that is later used by the Golgi algorithm), their morphological and intensity 
features, and a classification of the cell based upon its DNA or nucleus features. In a 

20 preferred embodiment, block 215 classifies the ceU based upon its stage in the cell cycle. 
The output of algorithm 21 1 are Golgi features which are input to 217 for classification. 
Preferably, this classification is based upon (1) the type of Golgi arrangement (e.g. the 
degree of diffusion, dispersion, and normalcy) and/or (2) the concentration of one or more 
Golgi componaits (depicted as intensity of signal emitted by one or more Golgi markers). 

25 Finally, flie output of tubulin algorithm 213 is the tubulin features which are iiq)ut to 219 
for classification. In one embodiment, this classification characterizes the overall shape of 
a cell. 

Figure 3 depicts a process flow that may be employed to characterize Golgi 
appearing in images of cells. The process depicted in Figure 3 (identified by the reference 
30 300) may serve as a combination of algorithms 209 and 211 depicted in Figure 2, for 
example. 

Initially, the individual cells, or at least the regions of those cells harboring the 
Golgi complex, are identified. GeneraUy, this process is referred to as "segmentation." 
This can be accomplished in numerous ways. In the embodiment depicted in Figure 3, it is 
35 accompUshed in operations 301 and 303. In this example, the image analysis tool initially 
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identifies the nucleus of each cell captured in the image under consideration. Since images 
from dififeent channels can be well registrated, nucleus can be first identified in the DNA 
channel and then overlaid to the image under consideration. Identifying nucleus in the 
DNA channels may be accomplished using any of a number of nucleus 
5 identification/segmentation algorithms. As shown in Figure 2, the output of nucleus 
algorithm 209 is provided to Golgi algorithm 211. 

After the nuclei have been identified at 301, the Golgi image analysis routine next 
defines a "ring region" around each nucleus. See 303. Generally, this step serves to define 
the perinuclear region. It will be described in detail with reference to Figures 4 A and 4B. 
10 For now, it sufficient to understand that the purpose of operation 303 is to define a region 
that will encompass or subsume some or all of the Golgi complex in a normal interphase 
cell. 

Within the regions defined in operation 303, the features of the Golgi complex may 
be identified using intensity versus position data. For example, each pixel in the ring 

15 regionis associated with a particular intensity, at the channel in which the Golgi complex is 
imaged. From such intensity data, varioxis morphological and/or statistical features may be 
extracted in order to characterize the Golgi complex. See 305. These features will be 
described in more detail below. In one particularly interesting embodiment, the features of 
interest include kurtosis, standard deviation, mean, and singular value decomposition, all 

20 obtained firom the intensities of the pixels in the ring region. 

The statistical and morphological features can provide very usefiil information for 
characterizing Golgi complex. In themselves, however, these parameters have no 
biological meaning. In a preferred embodiment, algorithm 300 converts information 
contained in these parameters to biologically relevant classifications. This operation is 
25 depicted in block 307. In a preferred embodiment, operation 307 classifies all cells into 
one of four primary Golgi states: normal, diffuse, disperse, and dififixse/disperse. 

Furflier biologically relevant information can be obtained by considmng an entire 
population of cells, and their associated Golgi states. Thus, algorithm 300 concludes with 
an operation 309, in which a population analysis is conducted. In one example, the 
30 relative percentages of cells in any of the stages of cell cycle having Golgi in each of the 
four above-mentioned end states is computed fix>m data on Golgi status and cell cycle 
stage of each individual cell. Note that statistics of the Golgi status may be computed 
independently for interphase and mitotic cells. 
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Segmentation 

One approach to segmentation is depicted in Figure 4A. As shown tiiCTe, an image 
401 includes a pluraUty of cell nuclei images 403 identified, for example, with a DNA- 
binding marker. Segmentation effectively converts image 401 into discrete 
5 images/representations for the DNA of each cell as shown at 405. M a preferred 
embodiment, this collection of representation 405 is provided as a mask providing 
intensity as a function of position for each cell nucleus in image 401. 

Individual cell representations 405 may be extracted torn image 401 by various 
image analysis procedures. Preferred approaches include edge finding routines and 

10 threshold routines. Some edge finding algorithms identify pixels at locations where 
intensity is varying r^idly. For llireshold routines, pixels contained within fhe edges will 
have a higher intensity than pixels outside the edges. Threshold algorithms convert all 
pixels below a particular intensity value to zero intensity in an image suteegjon (or the 
entire image, depending upon the specific algoriton). The threshold vahie is chosen to 

15 discriminate between nuclei images and background. All pixels with intensity values 
above threshold in a given neighborhood are deemed to belong to a particular cell nucleus. 

The concepts underlying fluesholding are well known. A threshold value is chosen 
to extract those features of flie image having intensity values deemed to correspond to 
actual cells (nuclei). Typically an image will contain various peaks, each having 
20 collections of pixels with intensity values above the threshold. Each of the peaks is 
deemed to be a separate "cell" or "nucleus" for extraction during segmentation. 

An appropriate threshold may be calculated by various techniques. Li a specific 
embodiment, the threshold value is chosen as the mode (highest value) of a contrast 
histogram. In this technique, a contrast is computed for every pixel in the image. The 

25 contrast may be computed as the intensity difference between a pixel and its neighbors. 
Next, for each intensity value (0-255 in an eight byte image), the average contrast over all 
pixels in the image is computed. The contrast histogram provides average contrast as a 
function of intensity. The threshold is chosen as the intensity value having the largest 
contrast. See "The hnage Processing Handbook," Third Edition, John C. Russ 1999 CRC 

30 Press LLC IEEE Press, and "A Survey of Thresholding Techniques," P.K. Sahoo, S. 
Soltani and A.K.C. Wong, Computer Vision, Grq)hics, and Image Processing 41, 233-260 
(1988), both of which are incorporated herein by reference for all purposes. 

In another specific embodiment, edge detection may involve convolving images 
with the Laplacian of a Guassian filter. The operation performed on the image is given by 
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Here I is the intensity of a pixel, x and y are the coordinates in the original image, 

f{x, y) = j\l{x -u,y- v)g{u, v)dudv 

and g is the Lq)lacian of a Guassian filter ^ e ^ . The zero-crossings of the 

resulting image fix.y) are detected as edge points. The edge points are linked to form 
closed contours, thereby segmenting the relevant image objects. See The Image 
Processing Handbook, referenced above. Figure 4B depicts the transformation in a very 
course feshion. An original image 422 is convolved with the Laplacian of a Guassian filter 
to give a new image 424 which contains positive and negative values at tiie component 
pixels. Note tiiat tiiis figure shows the mask of an object, not detected edges/contours. 
Wherever the sign changes in moving &om one pixel to the next a contour results. 

Generation Of Ring Region 

As indicated in the discussion of 3, it is often desirable to generate a ring about tiie 
ceU nucleus (see block 303) in order to define a region where normal Golgi are likely to 
reside. This operation is depicted in cartoon fashion in Figure 5A. As shown there, an 
image of a ceU's nucleus 503 is bounded by a ring region 505. Within ring region 505, a 
normal Golgi organeUe 507 appears in the image. As explained above, normal Golgi 
typically reside in the perinuclear region, which is coestensive with ring region 505. 

Note that the size of the ring region can be chosen to allow some or all of the 
normal Golgi to be subsumed. In a preferred embodiment, the size of ring region 505 is 
chosen to subsume all of the normal Golgi in a majority of intaphase cells 507. DifEuse 
and dispersed Golgi typically are not confined to a perinuclear region. Therefore, ring 
region 505 may not, depending upon the ring width setting, subsume all the Golgi in these 
states. The algorithm can use this fact to distinguish, or help distinguish, normal Golgi 
ftom difiuse and dispersed Golgi. Assuming tiiat the total "quantity of Golgi" is 
approximately consistent across normal, diffuse, and disperse states, tiien the "amount of 
Golgi" located within ring region 505 will be greatest in the case of flie normal Golgi state. 
Because some significant fraction of the Golgi Ues outside of ring region 505 in the case of 
difiuse and/or disperse Golgi states, tiie total amount of Golgi detected within region 505 
will be less in these states. In preferred embodiments, tiiis is not tiie primary metiiod used 
to deduce tiie Golgi state. Otiier features of pixel distributions (described below) are used 
more typically. 

The ring shaped region about tiie nucleus may be defined using various techniques. 
One suitable technique is depicted in Figure 5B. As shown fliere, an image segment 511 
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comprises a two-dimensional collection of locations or pixels. Each of these is 
represented by a small rectangular box. The pixels where the nucleus resides are shown 
with the reference letter '"N." To define the ring region, the algorithm locates the nearest 
neighbor pixels about the edge of the nucleus. In one embodiment, this is accomplished 
5 by identifying the four nearest neighbor pixels for each pixel identified as belonging to the 
nucleus. Those neighboring pixels which themselves form part of the nucleus are not 
considered. In Figure 5B, these nearest neighbor pixels are identified by the reference 
letter **X." To provide the appropriate width for the ring region, this process may be 
repeated any niraiber of times. In the example shown in Figure 5B, it is repeated once-to 
10 provide a total of two iterations. Jn the second iteration, each of the pixels considered in 
the first iteration is used to identify four nearest neighbors. In Figure 5B, the second 
iteration nearest neighbors are identified by the reference letter "O." 

The exact number of iterations used to define the ring region can vary depending 
upon the number of parameters. First, one must decide whether the ring region should 
15 subsume only Golgi residing in the close perinuclear region or should subsume Golgi 
residing within a wider portion of the cell, or even the entire cell itself. Second, the 
desired width of the ring region may vary depending upon the magnification of the image, 
the resolution of the image, and other related properties of the image. 

Note that the mechanism depicted in Figure 5B represents but a single ^proach to 

20 defining the ring region. Some other suitable techniques will be readily apparent to those 
of skill in the art. Further, other techniques may be employed to define the region within 
which the Golgi may be subsumed. For example, some techniques will not consider the 
DNA or nucleus of a cell. Rather, they will simply focus on the image of the pertinent 
Golgi component and then consider appropriate morphological and/or statistical 

25 parameters to distinguish the Golgi of one cell firom the Golgi of another cell. Still fiirther, 
other techniques will consider other non-Golgi features of an image to distinguish one cell 
firom another. For example, some algorithms may distinguish cells based upon images of 
the plasma membrane, cytoskeleton, etc. U.S. Patent Application No, 09/792,013 (Atty. 
Docket No. CYTOP013), previously incorporated by reference, describes a technique that 

30 uses tubuhn (or other cytoskeletal component) and DNA markers together to identify 
discrete cells. Regardless of which technique is employed, in the end, regions subsimiing 
some or all of the Golgi within discrete cells must be separately identified. Then, the 
image of the appropriate Golgi marker can be analyzed to characterize the Golgi in 
accordance with this invention. Note that while separate analysis of Golgi regions in each 

35 individual cell is often preferable, this is not a necessary requirement of the algorithm. 
Upon creation of "ring" areas, all pixels in all of the rings can be analyzed simultaneously. 
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Relevant Golei Features 

In order to effectively characterize the Golgi complex of individual cells, the 
present invention may make use of various parameters derived from Golgi complex 

5 images. These parameters may represeat the shape, size, concentration, texture, and 
amount of Golgi components in an individual ceU. In one preferred algorithm of the 
present invention, one set of parameters is used to characterize the "type" of Golgi and 
another parameter or set of parameters is used to characterize the amount of Golgi 
complex in an individual cell. The types of Golgi may include normal, diffuse, disperse, 

10 and disperse/diffuse as discussed above. Of course, other suitable processes may consider 
different and/or additional types of Golgi. The amount of Golgi may be determined from 
tiie local concentrations of a marked Golgi component summed over the entire region of 
interest (e.g. a ring region). 

Note that while die discussion below is organized sequentially, the described 

15 features of different nature may be used simultaneously to classify Golgi states. The 
parameters chosen for use with this invention should allow discrimination between 
biologically relevant classifications. Further, they may provide a continuous measure of 
the degree to which an individual cell exhibits features of any of the classifications. As 
mentioned, particularly interesting parameters wiU relate to one or more of flie foUowing 

20 general features of the Golgi complex image: the shape (such as peakedness) of an 
intensity (marker concentration) histogram, the texture of the Golgi areas, the overall 
intensity (some of local marker concentrations) of the Golgi image, and the moment 
(including moments of different orders) of the Golgi complex about the nucleus. Another 
set of features for the Golgi complex are statistics computed from a histogram of the 

25 intensity of pixel intensities by angle from tiie center of the nuclei. The x-axis of this 
histogram is the angle of tiie vector connecting the center of the nuclei to the ring pixel. 
The y-axis is tiie sum of tiie pixel intensities at a given angle. Whrai tiie histogram is 
unimodal tiie Golgi complex is likely classified as normal, if tiie histogram has many 
modes the Golgi complex is likely classified as dispersed and if tiie histogram is unifonn 

30 the Golgi complex is hkely classified as diffuse. 

Regarding peakedness, Figure 6 depicts sample histograms for normal and diffuse 
Golgi. As shown in Figure 6, a histogram plots tiie pixel count (number of pixels in an 
image meeting a criteria) as a fimction of pixel intensity. Remember that flie intensify is a 
measure of tiie local concentration of a marked component in tiie Golgi conq)lex. A Golgi 
35 complex wifli hi^ local concentrations of flie marked Golgi component wUl have a 
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narrow intensity distribution as shown by curve 601. A difiEuse Golgi complex will have a 
vnder distribution as illustrated by curve 603. 

Generally, normal Golgi, and to some degree dispersed Golgi, possess local regions 
of high concentration of marked Golgi component. Therefore, these types of Golgi wiU 
5 have histograms with relatively narrow distributions. M contrast, diffuse Golgi complexes 
have their marked components relatively evenly distributed throughout most or all of the 
ring area. Therefore, the histogram of such difiiise Golgi will have a relatively wide 
distribution. In Figure 6, curve 601 might represent the histogram of normal or dispersed 
Golgi, while curve 603 might represent a histogram of difiiise Golgi. 

10 Another parameter of interest in characterizing Golgi is the texture. The texture 

may be characterized by a number of paraiheters such as the singula^ value decomposition 
of a matrix of intensity values representing the marked Golgi component. Generally the 
texture of an object relates to the size and shape of the granules or other components of an 
image. Various textm« related features can help to discriminate between normal and 

15 dispersed Golgi and/or between dispersed and diffuse/dispersed states. 

The overaU intensity associated with an image of Ihe Golgi complex can represent 
the total amount of particular Golgi component(s) within a given cell or region of a cell. 
Figure 7 illustrates how this concept can be used to distinguish between normal Golgi and 
dispersed Golgi using an algorithm that employs a ring region as described above. As 

20 shown in Figure 7, a cell 701 possesses normal Golgi while a ceU 703 possesses dispersed 
Golgi. Each cell includes a nucleus 705 and a plasma membrane 707. In the case of cell 
701, the Golgi complex (represented by reference number 709) resides entirely within a 
ring region 711. In this ceU, all of its Golgi resides within ring region 711. Therefore, the 
total intensity of a Golgi maricer found within ring region 711 will represent all of the 

25 Golgi in cell 701. In contrast, cell 703 has a comparable ring region 711' which subsumes 
only a fraction of the Golgi in that cell (709'). The remainder of the dispersed Golgi Ues 
outside the ring region 71 1'. As a consequence, it may be expected that the total intensity 
of Golgi calculated for cell 703 (using ring region 711') will be less than the total intensity 
calculated for cell 701. 

30 The biologically relevant features of the Golgi complex may be obtained from 

various mathematical parameters. Among the parameters of interest are the following: the 
total area of the ring region in which the Golgi marker is considered, the mean intensity of 
aU pixels in the ring region, the standard deviation of the pixel intensities in ring region, 
the kurtosis of the pixel intensities in the ring region, and the second, third, and fourth 

35 largest eigenvalues obtained by a singular value decomposition of a matrix of pixels in the 
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ring. Also, sometimes, the Golgi "type" (noimal, diffuse, dispersed, or diffuse and 
dispersed) is treated as a parameter. This is technically a characterization derived from 
parameters such as the SVP eigenvalues, the kurtosis, etc. In a particularly preferred 
embodiment, flie following four relevant parameters are used to characterize the Golgi 
complex: mean, standard deviation, kurtosis, and singular value decomposition. 

Singular value decomposition and kurtosis can be derived from digitized images 
using conventional ^proaches. Kurtosis is given by nWCstd)*. where std is the standard 
deviation of intensity of the pixels in the ring region in the image and nu is given by the 
following expression: 



^4 =^I!!j^<-^' 



In this expression, N is the number of pixels in the ring region in the image, xi is the pixel 
intensity, and 3c is the mean pixel intensity. The kurtosis is particularly useful in 
distinguishing diffuse Golgi from normal or dispersed Golgi. Particularly, higher values of 
kurtosis suggest a higher degree of the Golgi "diffusion." 

15 The singular value decomposition is technique that opaales on a matrix of pixel 

intensity values arranged in their relative spatial positions as in the image. Preferably, 
Ihese pixel intensities are obtained from a ring region as described above. The relevant 
values are obtained by multiplying the matrix with its transpose and obtaining tiie 
eigenvalues. The singular value decomposition provides information about the texture of 

20 the Golgi complex in the image. Thus, it is useful in distinguishing different states of the 
Golgi con^lex. 

One aspect of tiie present invention is the ability to estimate the amount of Golgi 
(or a component of the Golgi) in a given cell or region of a ceU. This aUows one to 
characterize cells based upon relative or absolute quantity of a particular Golgi component 

25 within some region of the cell such as tiie perinuclear region. In general, an agent applied 
to highUght Golgi should emit a signal that is proportional to the amount of agent that has 
bound to a Golgi component. Thus, the amount of signal (usually indicated by the signal 
intensity summed or integrated over the entire region of interest) provides a direct 
indication of the quantity of a Golgi component present When this is the case, the present 

30 invention allows one to obtain an accurate measure of the amount of Golgi in a given ceU 
or cell region. 
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Techniques for estimating the quantity of DNA in a ceU nucleus are described US 
Patent AppUcation No. 09/729,754, previously incorporated by reference. These 
techniques may involve subtracting background signal and other image processing 
procedures. In general, such techniques apply to estimating the quantity of Golgi in a cell 
or region of a cell. 



riflssification of Golgi 

Optionally, the algorithms of this invention may specifically classify the Golgi 
complex into one or more biologicaUy relevant classification based upon parameters such 

10 as those described above. As mentioned previously, one example of biologicaUy relevant 
classifications include the four classifications depicted in Figures 1 A- ID. In the context 
of the general process flow depicted in Figure 3, the biologically relevant classification 
takes place at block 307. This operation takes as its inputs morphological, textual, and/or 
statistical parameters and provides as an output the biological relevant classifications. It 

15 accompUshes this using an appropriate model provided in the form of a neural network, 
linear or non-linear mathematical expression, a tree or graph, and the like. As previously, 
mentioned, one preferred approach employs a classification and regression tree. 

Classification and regression trees (CART) are well known tools for classifying 
objects. They are described in Brieman et al., (1984) Classification and Regression Trees. 

20 Monterey. Wadswiith and Brooks/Cole. Figure 8 depicts one example of a classification 
and regression tree (CART) 800. In this example, there are three input parameters and 
four possible classifications. The input parameters are identified as, fi, fz, and fs. The 
classifications are denoted C,, C2., Cj.and C4. In this hypothetical example, the tree 
initially considers the input parameter fs. If the value of {3 is less than 0.1, the tree 

25 branches in one direction. If, on the other hand, the value of fj is greater than or equal to 
0.1, the tree branches in the opposite direction. Assuming that the value of h is less than 
0.1, the tree next requires that the parameter fz be considered. In Has example, if the value 
of f2 is less than 100, the model classifies the Golgi complex in Ci. On the other hand, if 
the value of fj is greater than or equal to 100 the model classifies the Golgi complex in C2. 

30 Similarly, if the value of fj is found to be greater than or equal to 0.1, then the value of the 
input parameter fi is considered. Should the value of fi be less than 3, the tree classifies 
the Golgi complex in C3. Finally, if the value of f, is found to be greater than or equal to 
3, then the model classifies the Golgi complex in C4. 
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Obviously, the parameters ft, f2, and fa will be relevant to classification of Golgi, in 
accordance with this invention. For example, f\ might correspond to the mean pixel 
intensity of the Golgi channel, £2 might correspond to the standard deviation of the pixel 
intensity values of the Golgi channel, and fa might correspond to the second, third, and/or 
5 foxuA singular value decomposition of a matrix of pixel intensity values in the ring region. 
The classification C\ through C4 might correspond to the Golgi complex states depicted in 
Figures 1 A through ID. 

Qassification models may be generated using numerous techniques. These include 
regression analyses (e.g., techniques for generating CARTs), neural networks, ma ximum 
likelihood/mixture models, etc. In general, any model requires a valid training set 
containing numerous samples that span the range of likely cases that will be encountered 
in practice. The saniples should span a wide range of classification types, mcluding types 
that vary in degree between extreme end cases (e.g., the difiRise/disperse Golgi state). In 
addition, the samples should include a wide range of input parameters. Each sample will 
have an ascribed classification and clearly defined input parameters. In this case, it will 
typically be necessary for an experienced scientist to classify images based upon their 
Golgi state. These classifications together with the relevant parameters (e.g. mean, 
standard deviation, kurtosis, and singular value decomposition) are provided to a tool for 
generating the ^>propriate classification model. 

20 To maximize the predictive accuracy of the model, it may be appropriate to 

gmerate separate models for specific ranges of conditions. Typically, a separate model 
will be generated for each different cell line to be considered, because the appearance of 
the Golgi complex varies widely fi-om cell line to cell line. Further, settings associated 
with the image analysis ^paratus may cause significant variations in the image analysis. 

25 Therefore, separate models may be appropriate for different image analysis settings such as 
magnification, illumination intensity and the like. As noted, the Golgi complex may be 
imaged using various components contained within the Golgi complex. For example, 
some components may be detected with labeled lectins and other markers may be detected 
witih labeled antibodies. Each of these separate component-marker combinations may 

30 deserve its own model, assuming more than one marker is used for the Golgi complex. 

As indicated above, certain embodiments of the invention do not necessarily 
employ a classification operation. In these embodiments, the algorithm simply emploj^ 
the parameters generated a block 305 in order to draw conclusions about a population of 
cells. Or alternatively the analysis tool may generate coefi&cients for biologically relevant 
35 features such as diflRision. In one example, the kurtosis of the Golgi image (or some 
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derivation from kurtosis) serv^ as a coefficient for diffiision. Further, the system could 
output both the biologically relevant classification of end states and the particular 
parameters used to generate these classifications. 

Analysis of Populations of Cells 

5 As mentioned, the most useful biological information often comes from im^es of 

a population of cells. The population may comprise a very limited range of variations, 
such as a single cell treated with a single drug at a single concentration. Or it may 
comprise a diverse set of variations such as multiple cell lines, each treated with multiple 
concentrations of a drug (or even multiple drugs believed to operate via a single 
10 mechanism of action). 

Note that many images used with the present invention will contain many discrete 
cells. The images often show several hundreds of cells, although the actual number 
depends on the application. Such cells may collectively comprise the population of 
interest. Or multiple wells or a subset of a single well may comprise the population. 

15 Regardless of how the population is defined, it should preferably contain a 

relevant sample of cells exposed to a condition of interest From this popiilation, 
conclusions about a given stimulus* effect on the Golgi complex can be drawn. The Golgi 
complex in the population can be characterized by the percent of cells in each primary 
category of Golgi end state or by a distribution of Golgi specific parameters across the 

20 population or by some other measure of Golgi parameters and/or classes across the 
population. As mentioned, interphase cells typically hold the most relevant information 
about an effect of a stimulus on the Golgi complex. This is because unperturbed 
interphase cells typically have normal Golgi of the type depicted in Figure 1 A. Deviations 
from this normal state, such as illustrated in Figures IB, IC, and ID, provide clues about 

25 the effect of a stimulus. Mitotic cells, in contrast, do not exhibit the **normal" Golgi state 
even in the absence of an additional stimulus and instead exhibit the diffuse and/or 
disperse state. Therefore, it is usefiil to characterize cells based on their position in the cell 
cycle - at least as either mitotic or interphase. 

As explained m U.S. Patent Apphcation No. 09/729,754, previously incorporated 
30 by reference, an image of a cells nucleus (DNA) can be analyzed to distinguish between 
the Gi, S, and G2, and M phases of a cell. Briefly, the total amount of DNA in a cell 
nucleus can indicate whether an interphase cell is in the Gi, S, or G2 phase. Another 
parameter or grouping of parameters can discriminate between mitotic and interphase 
cells. One example of such parameter is the variance in intensity exhibited by an indicator 
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of DNA concentration. One example of a group of parameters is average pixel intensity 
and area of the nucleus. In this second example, the average pixel intensity is the average 
of the pixel-based intensities of the nucleus and "area" is the total area of the nucleus. 

In a particularly preferred embodiment, the cells of a population are first 
5 characterized using marked DNA. This classification at least distinguishes between 
mitotic and interphase cells. Then, the same cells are characterized using marked Golgi, 
Particular attention is paid to the Golgi complex of the interphase cells. 

Software/Hardware 

Generally, embodiments of the present invention employ various processes 
10 involving data stored in or transferred througih one or more computer systans. 
Embodiments of the present invention also relate to an q)paratus for performing these 
operations. This apparatus may be specially constructed for the required purposes, or it 
may be a general-purpose computer selectively activated or reconfigured by a computer 
program and/or data structure stored in the computer. The processes presented herein are 
15 not inherently related to any particular computer or other q)paratus. In particular, various 
genaral-purpose machines may be used with programs written in accordance wifli the 
teachings herein, or it may be more convenient to construct a more specialized apparatus to 
perform the required method steps. A particular structure for a variety of these machines 
will appear fi'om the description given below. 

20 In addition, embodiments of the present invention relate to computer readable 

media or computer program products that include program instructions and/or data 
(including data structures) for performing various compute-implemented operations. 
Examples of computer-readable media include, but are not limited to, magnetic media 
such as hard disks, floppy disks, and magnetic tape; optical media such as CD-ROM disks; 

25 magneto-optical media; semiconductor memory devices, and hardware devices that are 
specially configured to store and perform program instructions, such as read-only memory 
devices (ROM) and random access memory (RAM). The data and program instructions of 
this invention may also be embodied on a carrier wave or other transport medium. 
Examples of program instmctions include both machine code, such as produced by a 

30 compiler, and files containing higher level code that may be executed by the computer 
using an interpreter. 

Figure 9 illustrates a typical computer system that, when appropriately configured 
or designed, can serve as an image analysis apparatus of this invention. The computer 
system 900 includes any number of processors 902 (also referred to as central processing 
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units, or CPUs) that are coupled to storage devices including primary storage 906 
(typically a random access memory, or RAM), primary storage 904 (typically a read only 
memory, or ROM). CPU 902 may be of various types including microcontrollers and 
microprocessors such as programmable devices (e.g., CPIDs and FPGAs) and 

5 unprogrammable devices such as gate array ASICs or general purpose microprocessors. 
As is well known in the art, primary storage 904 acts to transfer data and instructions uni- 
directionally to the CPU and primary storage 906 is used typically to transfer data and 
instructions in a bi-directional manner. Both of these primary storage devices may include 
any suitable conq)uter-readable media such as those described above. A mass storage 

10 device 908 is also coupled bi-directionally to CPU 902 and provides additional data 
storage capacity and may include any of the computer-readable media described above. 
Mass storage device 908 may be used to store programs, data and the like and is typically a 
secondary storage medium such as a hard disk. It wiU be appreciated that the information 
retained within the mass storage device 908, may, in appropriate cases, be incorporated in 

15 standard fashion as part of primary storage 906 as virtual memory. A specific mass 
storage device such as a CD-ROM 914 may also pass data uni-directionally to the CPU. 

CPU 902 is also coupled to an interfece 910 that connects to one or more 
iiqjut/output devices such as such as video monitors, track balls, mice, keyboards, 
microphones, touch-sensitive displays, transducer card readers, magnetic or paper t^e 

20 readers, tablets, styluses, voice or handwriting recognizers, or other well-known input 
devices such as, of course, other con^uters. Finally, CPU 902 optionally may be coupled 
to an external device such as a database or a computer or telecommunications network 
using an external connection as shown generally at 912. With such a connection, it is 
contemplated that the CPU might receive information from the networic, or might output 

25 information to the network in the course of performing the method steps described herem. 

In one embodiment, the computer system 900 is directly coupled to an image 
acquisition system such as an optical imaging system that captures images of ceUs. Digital 
images from the hnage generating system are provided via intaface 912 for image analysis 
by system 900. Alternatively, the images processed by system 900 are provided from an 

30 image storage source such as a database or other repository of ceU images. Again, the 
images are provided via interface 912. Once in the image analysis apparatus 900, a 
memory device such as primary storage 906 or mass storage 908 buffers or stores, at least 
temporarily, digital images of the cell. Typically, the cell images will show locations 
where Golgi, and possibly DNA, exists within the cells. In these images, local values of a 

35 Golgi image parameter (e.g., radiation intensity) correspond to amounts of a Golgi 
component at the locations within the cell shown on the image. With this data, the image 
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analysis apparatus 900 can perform various image analysis operations such as 
distinguishing between normal and diflRise and/or dispersed Golgi and estimating the 
amount of Golgi in a cell. To this end, the processor may perform various operations on 
the stored digital image. For example, it may analyze the image in mamier that extracts 
5 values of one or more Golgi state parameters that correspond to a Golgi primary state and 
classifies the cell as either normal of diffuse, disperse or difiuse/disperse based upon the 
extracted values of the parameters. Altematively, or in addition, it may estimate a total 
value of the Golgi image parameter taken over at least a perinuclear region of the cell. 

Fvamples 

10 The following examples each involved generation and analysis of images obtained 

firom SK0V3 cells treated with a Lens culinaris lectin marker. The cells were incubated 
with each drug for 24 hours before fixation and staining. In each case, ring regions were 
obtained by the Golgi segmentation algorithm described above. In addition, the Golgi 
complex within those ring regions was predicted, using a CART model as described 

15 above, as either normal, difiiise, dispersed, or diffuse and dispersed. 

In a first example, SK0V3 cells that were treated with a low concentration (0.6 
nanomolar) of the anti-microtubule drag Taxol. La this example, the Golgi complex 
remams normal as determined by the model An expert independently examined the 
unages and concluded that the Golgi complex was normal. In a second example, SK0V3 

20 cells treated with a high concentration (2.6 micromolar) of microtubule-de polymerizing 
drag Nocodazole. Using the same model, the Golgi conq)lex was found to be dispersed. 
And the expert independently confirmed that the Golgi complex was in fact dispersed. 
Finally, in a third example, SKOV3 cells were treated with a drag that disrapts transport 
into and out of the Golgi complex - Brefeldin A, The drag was provided m a 

25 concentration of 79 micromolar. Li this case, the image analysis algorithm classified the 
Golgi complex as diffiise. The expert independently confirmed this. 

Conclusion 

Although the above has generally described the present invention accordmg to 
specific processes and apparatus, the present invention has a much broader range of 
30 applicabiUty. In particular, the present invention is not limited to a particular kind of cell 
component, not just the Golgi complex. Thus, in some embodiments, the techniques of 
the present invention could provide information about many different types or groups of 
cellular organelles and components of all kinds. Of course, one of ordinary skill in the art 
would recognize other variations, modifications, and alternatives. 



wo 2002/067188 



PCTAJS2002/004492 



CLAIMS 

What is claimed is: 

5 1. A method, implemented on a computing device, of analyzing an image of one or 
more cells, the method comprising: 

(a) identifying a region of the image subsuming some or all of the Gplgi complex 
of a single cell; 

(b) within said region, automatically identifying the location of the Golgi complex; 

10 and 

(c) based upon at least one of (i) the Golgi complex location within the region and 
(ii) the Golgi concentration within the region, automatically mathematically characterizmg 
the Golgi complex within the single cell, 

15 2. The method of claim 1 further comprising: 

(d) based upon the mathematical characterization, automatically classifying the 
Golgi complex in a category that at least partially distinguishes between normal Golgi and 
Golgi that is dther diffuse or disperse or both difiuse and disperse. 

20 3. The method of claim 1, wherein the image comprises multiple cells, and wherein 
identifying the region of tiie image subsuming some or all of the Golgi conq)lex comprises 
segmenting the image into regions, each subsuming at least part of an individual cell. 

4. The method of claim 3, wherein segmenting comprises identifying locations of 
25 nuclei in the cells. 

5. The method of claim 4, wherein the nuclei are identified by identifying regions of 
the image where DNA is shown to concentrate. 

30 6. The method of claim 4, further comprising within the image dilating the locations 
of the nuclei to subsume the locations of some or all of the Golgi complex in the 
individual cells. 

7. The method of claim 1, wherein the image depicts concentration versus position of 
35 one or more Golgi components within the cell. 

8. The method of claim 7, wherein the concentration of at least one Golgi component 
corresponds to intensity in the image. 
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9. The method of claim 1, wherein one or more cells in flie image were treated with a 
material that binds to a component of the Golgi complex and emits a signal having an 
intensity corresponding to its concentration. 

5 

10. The method of claim 9 wherein tiie material is a lectin or antibody that binds to the 
component of the Golgi complex. 

11. The mefliod of claim 1, wherein the mathematical characterization of the Golgi 
10 complex comprises at least one of (i) an indicator of the peakedness of a histogram of at 

least one component of the Golgi complex, (ii) the texture of the Golgi complex and (iii) 
and Ihe amoxmt of Golgi complex in the regjoa 

12. The method of claim 1, wherein the mathematical characterization of flie Golgi 
15 complex comprises the kurtosis of intensity values obtained from tiie image. 

13. The mefliod of claim 1, wherein the mafliematical characterization of the Golgi 
complex comprises a singular value decomposition of intensity values obtained from the 
image. 

20 

14. The mefliod of claim 1, wherein flie mafliematical characterization of flie Golgi 
complex comprises at least one of a mean and a standard deviation of intensity values 
obtained from the image. 

25 15. The mefliod of any one of claims 12, 13, and 14, wherein flie intensity values 
coirespond to local concentrations of a component of flie Golgi complex wifliin flie single 
cell. 

16. The mefliod of claim 2, wherein classing flie Golgi complex comprises 
30 analyzing flie mafliematical characterization wifli a biological model of flie Golgi complex. 

17. The mefliod of claim 16, wherein flie biological model is a regression model or a 
neural network. 

35 18. The mefliod of claim 16, wherein flie biological model is a classification and 
regression tree. 
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19. The method of claim 2, further comprising characterizmg a population of cells 
from the image by considering the category of Golgi complex in each ceU of the 
population. 

5 20. The method of claim 19, further comprising predicting a mechanism of action from 
the characterization of the population. 

21. A computer program product comprising a machine readable medium on which is 
provided program mstmctions for analy2ing an image of one or more cells, the program 

10 instructions comprising: 

(a) program code for identifying a region of the image subsuming some or aU of the 
Golgi complex of a single cell; 

(b) program code for automatically identifying, within said region, the location of 
the Golgi complex; and 

15 (c) program code for using at least one of (i) the Golgi complex location wifliin the 

region and (ii) the Golgi concentration within the region, to automatically mathematically 
characterize the Golgi complex within the single cell. 

22. The computer program product of claim 2 1 further comprising: 

20 (d) program code for using the mathematical characterization to automatically 

classify the Golgi complex in a category that at least partially distinguishes between 
normal Golgi and Golgi that is either diffuse or disperse or both diffuse and disperse. 

23. The computer program product of claim 1, wherein the image comprises multiple 
25 cells, and wherein the program code for identifying the region of the image subsuming 

some or all of the Golgi complex conq)rises program code for segmenting the image into 
regions, each subsuming at least part of an individual cell. 

24. The computer program product of claim 23, wherein the program code for 
30 segmenting comprises program code for identifying locations of nuclei in the cells. 

25. The computer program product of claim 24, wherein the program code for 
identifying locations of nuclei comprises program code for identifying regions of the 
image where DNA is shown to concentrate. 

35 
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26. The computer program product of claim 24, further conqjrising program code for 
dilating the locations of the nuclei within the image to subsume the locations of some or 
all of the Golgi complex in the individual cells. 

5 27. The computer program product of claim 21, wherein the image depicts 
concentration versus position of one or more Golgi components within the cell 

28. The computer program product of claim 21, wherein the mathematical 
characterization of the Golgi complex comprises at least one of (i) an indicator of the 

10 peakedness of a histogram of at least one component of the Golgj conrplex, (ii) the texture 
of the Golgi complex and (iii) and the amount of Golgi complex in the region. 

29. The computer program product of claim 21, wherein the mathematical 
characterization of the Golgi complex comprises at least one of (i) the kurtosis of intensity 

15 values obtained from the image, (b) a singular value decomposition of intensity values 
obtained from the image, (c) a mean and, (d) a standard deviation of intensity values 
obtained from the unage. 

30. The computer program product of claim 22, >«^erein the program code for 
20 classifying the Golgi complex comprises program code for analyzing the mathematical 

characterization with a biological model of the Golgi complex. 

31. The computer program product of claim 30, wherem the biological model is a 
regression model or a neural network. 

25 

32. The computer program product of claim 30, wherein the biological model is a 
classification and regression tree. 

33. The computer program product of claim 22, further comprising program code for 
30 characterizing a population of ceUs from the image by considering the category of Golgi 

complex in each cell of the population. 

34. The computer program product of claim 33, further comprising program code for 
predicting a mechanism of action from the characterization of the population. 

35 
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35. A method, implemented on a computing device, of producing a model for 
classifying cells based upon the condition of Golgi within the cells, the method 
comprising: 

(a) receiving images of a plurality of cells of a training set, wherein individual cells 
5 ofthe training set have Golgi in various states; 

(b) analyzing the images to mathematically characterize the Golgi within the 
multiple cells from the training set; and 

(c) applying a modeling technique to the mathematical characterizations obtained 
in (b) to thereby produce the model 

10 

36. The method of claim 35, further comprising segmenting at least one image of cells 
corresponding to some members of the training set to deUneate individual cells within the 
image. 

15 37. The method of claim 36, wherein the segmentation comprises dilating regions 
where the nuclei ofthe individual cells are found to reside. 

38. The method of claim 35, wherein the images depict concentration versus position 
of one of more Golgi components within the cells. 

20 

39. The method of claim 38, wherein the concentration of at least one Golgi 
component corresponds to intensity in the images. 

40. The method of claim 35, wherein the various states of Golgi in the cells of the 
25 training set are produced, at least in part, by treatment with multiple exogenous agents. 

41. The method of claim 40, wherein the exogenous agents comprise at least one of 
drugs and drug candidates. 

30 42. The method of claim 35, wherein the mathematical characterization of the Golgi 
comprises at least one of (i) an indicator of the peakedness of a histogram of at least one 
component of the Golgi, (ii) the texture of the Golgi and (iii) the amount of Golgi in the 
region. 

35 43. The method of claim 35, wherein the mathematical characterization of the Golgi 
comprises one or more of the standard deviation, the mean, the kurtosis, and a singular 
value decomposition of intensity values obtained from individual cells ofthe image. The 
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method of claim 1, wherein the modeling technique includes at least one of a neural 
network, a regression technique, and a genetic algorithm. 

44. The method of claim 35, wherein the modeUng technique comprises generating a 
5 classification and regression tree. 

45. A computer program product comprismg a machine readable mediimi on which is 
provided program instructions for producing a model for classifying cells based upon the 
condition of Golgi within the cells, the program instmctions comprising: 

10 (a) program code for receiving images of a plurality of cells of a training set, 

wherein individual cells of the training set have Golgi in various states; 

(b) program code analyzing the images to mathematically characterize the Golgi 
within the multiple cells from the training set; and 

(c) program code for ^plying a modeling technique to the mathematical 
1 5 characterizations obtained in (b) to thareby produce the model 

46. The computer program product of claim 45, further comprising program code for 
segmenting at least one image of cells corresponding to some members of the training set 
to delineate individual cells within the image. 

20 

47. The computer program product of claim 46, wherein the program code for 
segmentation comprises program code for dilating regions where the nuclei of the 
individual cells are found to reside. 

25 48. The computer program product of claim 45, wherein the mathematical 
characterization of the Golgi comprises at least one of (i) an indicator of the peakedness of 
a histogram of at least one component of the Golgi, (ii) the texture of the Golgi and (iii) 
the amount of Golgi in the region. 

30 49. The computer program product of claim 45, wherein the mathematical 
characterization of the Golgi comprises one or more of the standard deviation, the mean, 
the kurtosis, and a singular value decomposition of intensity values obtained from 
individual cells of the image. The method of claim 1, wherein the modeling technique 
includes at least one of a neural network, a regression technique^ and a genetic algorithm. 

35 

50. The computer program product of claim 45, wherein the modeling technique 
comprises program code for generating a classification and regression tree. 
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51. An apparatus for automatically analyzing an image of one or more cells, the 
apparatus comprising: 

an interface configured to receive the image of one or more cells; 
a memory for storing, at least temporarily, some or all of the image; and 
one or more processors in communication with the memory and designed or 
configured to segment the image into discrete regions, each subsmning some or all of the 
Golgi complex in single cell, and mathematically characterize the Golgi complex of single 
cells by operating on the discrete regions. 

52. The ^paratus of claim 51, wherein the one or more processors is further designed 
or configured to classify the Golgi complex based upon the mathematical characterization 
of the Golgi complex. 

15 53. The apparatus of claim 52, wherein the one or more processors classifies the Golgi 
complex in a category that at least partially distinguishes between normal Golgi and Golgi 
that is difiuse or disperse. 

54, The apparatus of claim 52, wherein the one or more processors classifies the Golgi 
20 complex by using a biological model. 

55. The apparatus of claim 54, wherein the biological model is a classification and 
regression tree. 

25 56. The apparatus of claim 51, wherein the one or more processors segment the image 
by first identifying regions of the image corresponding to nuclei of the one or more cells. 

57. The apparatus of claim 56, wherein the one or more processors perform a dilation ■ 
operation at the regions of the nuclei on the image in order to subsume the Golgi complex 
30 of each of the one or more cells. 



5 



10 



58. The apparatus of claim 51, wherein the image to be received by the interface 
depicts concentration versus position of one or more Golgi components within the cell. 

35 59. The apparatus of claim 58, wherein the one or more Golgi components correspond 
to intensity values in the image to be received by the interface. 
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60. The apparatus of claim 51, wherein the one or more cells in the image were treated 
with a material that binds to a component of the Golgi complex and emits a signal having 
an intensity corresponding to its concentration. 

5 61. The apparatus of claim 51, wherein the one or more processors mathematically 
characterize the Golgi complex by calculating at least one or (i) an indicator of the 
peakedness of a histogram of at least one component of the Golgi complex, (ii) the texture 
of the Golgi complex, and (iii) the amount of Golgi complex in the discrete regions. 

10 62. The apparatus of claim 51, wherein the one or more processors mathematically 
characterize the Golgi complex by calculating at least one of the kurtosis, the mean, the 
standard deviation, and a singular value decomposition of intensity values obtained from 
the discrete regions. 

15 63. The apparatus of claim 51, wherein the one or more processors is further designed 
or configured to characterize a population comprising the one or more cells by considering 
a category of Golgi for each of the one or more cells. 

64. The apparatus of claim 63, wherein the one or more processors is further designed 
20 or configured to predict a mechanism of action from characterizing the population. 
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